Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic Detection of the Prosodic Structures of Speech Utterances

Identifieur interne : 001634 ( Main/Exploration ); précédent : 001633; suivant : 001635

Automatic Detection of the Prosodic Structures of Speech Utterances

Auteurs : Katarina Bartkova [France] ; Denis Jouvet [France]

Source :

RBID : ISTEX:D4C24465BE23C39366C4630171E0613FEEEE7216

Abstract

Abstract: This paper presents an automatic approach for the detection of the prosodic structures of speech utterances. The algorithm relies on a hierarchical representation of the prosodic organization of the speech utterances. The approach is applied on a corpus of radio French broadcast news and also on radio and TV shows which are more spontaneous speech data. The algorithm detects prosodic boundaries whether they are followed or not by pause. The detection of the prosodic boundaries and of the prosodic structures is based on an approach that integrates little linguistic knowledge and mainly uses the amplitude of the F0 slopes and the inversion of the slopes as described in [1], as well as phone durations. The automatic prosodic segmentation results are then compared to a manual prosodic segmentation made by an expert phonetician. Finally, the results obtained by this automatic approach provide an insight into the most frequently used prosodic structures in the broadcasting speech style as well as in a more spontaneous speech style.

Url:
DOI: 10.1007/978-3-319-01931-4_1


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Automatic Detection of the Prosodic Structures of Speech Utterances</title>
<author>
<name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
</author>
<author>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:D4C24465BE23C39366C4630171E0613FEEEE7216</idno>
<date when="2013" year="2013">2013</date>
<idno type="doi">10.1007/978-3-319-01931-4_1</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-8P0RP7B1-G/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003270</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">003270</idno>
<idno type="wicri:Area/Istex/Curation">003229</idno>
<idno type="wicri:Area/Istex/Checkpoint">000260</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000260</idno>
<idno type="wicri:doubleKey">0302-9743:2013:Bartkova K:automatic:detection:of</idno>
<idno type="wicri:Area/Main/Merge">001646</idno>
<idno type="wicri:Area/Main/Curation">001634</idno>
<idno type="wicri:Area/Main/Exploration">001634</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Automatic Detection of the Prosodic Structures of Speech Utterances</title>
<author>
<name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>ATILF - Analyse et Traitement Informatique de la Langue Franaise, 44 Av De La Libration, BP 30687, 54063, Nancy Cedex</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>Speech Group, LORIA Inria, F-54600, Villers-lès-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers-lès-Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>Université de Lorraine, LORIA, UMR 7503, F-54600, Villers-lès-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers-lès-Nancy</settlement>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>CNRS, LORIA, UMR 7503, F-54600, Villers-lès-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper presents an automatic approach for the detection of the prosodic structures of speech utterances. The algorithm relies on a hierarchical representation of the prosodic organization of the speech utterances. The approach is applied on a corpus of radio French broadcast news and also on radio and TV shows which are more spontaneous speech data. The algorithm detects prosodic boundaries whether they are followed or not by pause. The detection of the prosodic boundaries and of the prosodic structures is based on an approach that integrates little linguistic knowledge and mainly uses the amplitude of the F0 slopes and the inversion of the slopes as described in [1], as well as phone durations. The automatic prosodic segmentation results are then compared to a manual prosodic segmentation made by an expert phonetician. Finally, the results obtained by this automatic approach provide an insight into the most frequently used prosodic structures in the broadcasting speech style as well as in a more spontaneous speech style.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement>
<li>Nancy</li>
<li>Villers-lès-Nancy</li>
</settlement>
<orgName>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Grand Est">
<name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
</region>
<name sortKey="Bartkova, Katarina" sort="Bartkova, Katarina" uniqKey="Bartkova K" first="Katarina" last="Bartkova">Katarina Bartkova</name>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
<name sortKey="Jouvet, Denis" sort="Jouvet, Denis" uniqKey="Jouvet D" first="Denis" last="Jouvet">Denis Jouvet</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001634 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001634 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:D4C24465BE23C39366C4630171E0613FEEEE7216
   |texte=   Automatic Detection of the Prosodic Structures of Speech Utterances
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022